Video content analysis or video content analytics ( VCA), also known as video analysis or video analytics ( VA), is the capability of automatically analyzing video to detect and determine temporal and spatial events.
This technical capability is used in a wide range of domains including entertainment, KINECT , add-on peripheral for the Xbox 360 console video retrieval and video browsing,Dimitrova, Nevenka, et al. " Applications of video-content analysis and retrieval." IEEE multimedia 9.3 (2002): 42-55. health-care, retail, automotive, transport, home automation, flame and smoke detection, safety, and security. VCA usage increase in British Security , BSIA report The can be implemented as software on general-purpose machines, or as hardware in specialized video processing units.
Many different functionalities can be implemented in VCA. Video Motion Detection is one of the simpler forms where motion is detected with regard to a fixed background scene. More advanced functionalities include video trackingCavaliere, Danilo, Vincenzo Loia, and Sabrina Senatore. " Towards an ontology design pattern for UAV video content analysis." IEEE Access 7 (2019): 105342-105353. and egomotion estimation.
Based on the internal representation that VCA generates in the machine, it is possible to build other functionalities, such as video summarization,Ma, Yu-Fei, et al. " A user attention model for video summarization." Proceedings of the tenth ACM international conference on Multimedia. 2002. identification, behavior analysis, or other forms of situation awareness.
VCA relies on good input video, so it is often combined with video enhancement technologies such as video denoising, image stabilization, unsharp masking, and super-resolution.
Blocking a part of the video signal based on the signal itself, for example because of privacy concerns. |
IP cameras with intelligent video surveillance technology can be used to detect flame and smoke in 15–20 seconds or even less because of the built-in DSP chip. The chip processes algorithms that analyzes the videos captured for flame and smoke characteristics such as color chrominance, flickering ratio, shape, pattern and moving direction. |
Egomotion estimation is used to determine the location of a camera by analyzing its output signal. |
Motion detection is used to determine the presence of relevant motion in the observed scene. |
Shape recognition is used to recognize shapes in the input video, for example circles or squares. This functionality is typically used in more advanced functionalities such as object detection. |
Object detection is used to determine the presence of a type of object or entity, for example a person or car. Other examples include fire and smoke detection. |
Face recognition and Automatic Number Plate Recognition are used to recognize, and therefore possibly identify, persons or cars. |
Style detection is used in settings where the video signal has been produced, for example for television broadcast. Style detection detects the style of the production process. Style detection , Cees G.M. Snoek et al., Detection of TV news monologues by style analysis, ICME'04 |
Tamper detection is used to determine whether the camera or output signal is tampered with. |
Video tracking is used to determine the location of persons or objects in the video signal, possibly with regard to an external reference grid. |
Video scene content tamper analysis using free software. Video Error level analysis (VELA) |
Joint object discovery, classification and segmentation of targets in one or multiple related video sequences |
In many domains VCA is implemented on CCTV systems, either distributed on the cameras (at-the-edge) or centralized on dedicated processing systems. Video Analytics and Smart CCTV are commercial terms for VCA in the security domain. In the UK the BSIA has developed an introduction guide for VCA in the security domain. British Industry VCA Guide , 262 An Introduction to Video Content Analysis Industry Guide In addition to video analytics and to complement it, audio analytics can also be used.Audio Analytic
Video management software manufacturers are constantly expanding the range of the video analytics modules available. With the new suspect tracking technology, it is then possible to track all of this subject's movements easily: where they came from, and when, where, and how they moved. Within a particular surveillance system, the indexing technology is able to locate people with similar features who were within the cameras’ viewpoints during or within a specific period of time. Usually, the system finds a lot of different people with similar features and presents them in the form of snapshots. The operator only needs to click on those images and subjects which need to be tracked. Within a minute or so, it's possible to track all the movements of a particular person, and even to create a step-by-step video of the movements.
Kinect is an add-on peripheral for the Xbox 360 gaming console that uses VCA for part of the user input.
In retail industry, VCA is used to track shoppers inside the store. By this way, a heatmap of the store can be obtained, which is beneficial for store design and marketing optimisations. Other applications include dwell time when looking at a products and item removed/left detection.
The quality of VCA in the commercial setting is difficult to determine. It depends on many variables such as use case, implementation, system configuration and computing platform. Typical methods to get an objective idea of the quality in commercial settings include independent benchmarking i-Lids, Benchmarking initiative by the UK Home Office and designated test locations.
VCA has been used for Crowd control purposes, notably at The O2 Arena in London and The London Eye.
The EU is funding a FP7 project called P-REACT P-REACT Project Website to integrate video content analytics on embedded systems with police and transport security databases.
|
|